A Reflection on the Structure and Process of the Web of Data

نویسنده

  • Marko A. Rodriguez
چکیده

Marko A. Rodriguez is director’s fellow at the Center for Nonlinear Studies, T-7, Los Alamos National Laboratory, Los Alamos, New Mexico 87545; Markolanl.gov T he web community has introduced a set of standards and technologies for representing, querying and manipulating a globally distributed data structure known as the Web of Data. The proponents of the Web of Data envision much of the world’s data being interrelated and openly accessible for use by the general public. This vision is analogous in many ways to the web of documents of common knowledge, but instead of making documents and media openly accessible, the focus is on making data openly accessible. Providing data for public use has stimulated interest in a movement dubbed Open Data [1]. Open Data is analogous in many ways to the open source movement. However, instead of focusing on software, Open Data is focused on the legal and licensing issues around publicly exposed data. Together, various technological and legal tools are laying the groundwork for the future of global-scale data management on the web. As of today, in its early form, the Web of Data hosts a variety of data sets that include encyclopedic facts, drug and protein data, metadata on music, books and scholarly articles, social network representations, geospatial information and many other types of information. The size and diversity of the Web of Data is a demonstration of the flexibility of the underlying standards and the overall feasibility of the project as a whole. The purpose of this article is to provide a review of the technological underpinnings of theWeb of Data as well as some of the hurdles that need to be overcome if theWeb of Data is to emerge as the de factomedium for data representation, distribution and ultimately, processing. Technically, on the Web of Data, Uniform Resource Identifiers (URI) are used to identify resources [2]. For example, depending on what is being modeled, a URI can denote a city, a protein, a music album, a scholarly article or a person. In fact, in general any thing can be assigned a URI. An example URI is . This URI denotes the author of this article,Marko. The URI has information pertaining to the what (marko), where (www.lanl.gov) and how (http) of a resource. The URI is more general than the commonly used URL, as URIs are not required to resolve to retrievable digital objects such as documents and media. Instead, URIs can denote abstract concepts such as the person Marko, the class of dogs or the notion of friendship. Finally, the space of all URIs is an inherently distributed and theoretically infinite space. This attribute makes the URI space fit to represent massive amounts of data distributed worldwide. A convenient consequence of this space is that the Web of Data can emerge atop it. However, while URIs can denote things, they cannot denote how things relate to each other. Relating URIs is necessary in order to give greater meaning and context to each datum. Moreover, relating URIs is necessary to create the web aspect of the Web of Data. The Resource Description Framework (RDF) is a standardized data model for linking URIs in order to create a network/graph of space of all URIs [3]. RDF also supports the linking of URIs to primitive literals such as strings, integers or floating point values. An example RDF statement to denote the fact that “Marko knows Fluffy” is . In order to make long URIs more readable, namespace prefixes are generally used. With namespace prefixes, the previous statement can be represented as lanl:marko, foaf:knows, lanl:fluffy. All RDF statements have this three Feature

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Reflection on Kristeva's Approach to the Structure of Language

Reaching out to history and subject in terms of meaning variation, Kristeva could show that language cannot simply be a Saussurean sign system. Rather, she went on to delineate that language, beyond signs, is associated with a dynamic system of signification where the ''speaking subject'' is constantly involved in processing. Julia Kristeva, a French critic, psychoanalyst, theoretician, a post-...

متن کامل

Reflective Learning and Teaching: A Review

Introduction: One of the most important characteristic of human being is his ability to learn. Structuralists believe that learning is an active process through which learners explores the principles, meanings and facts by themselves. Learner’s participation in learning process is one of the active learning strategies and reflective learning is considered as an active teaching method which is i...

متن کامل

Reflection perspectives of Tabriz Nursing Student

Introduction: The phenomenon of knowledge explosion has led teachers to feel the necessity of training students so that they become reflective thinkers. This issue is more important for nursing students who are responsible for providing care for patients.This study is a part of another study arming at exploration of Nursing Students’ views on reflection on practice. Methods. 20 senior nursing...

متن کامل

Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...

متن کامل

The Intellectual Structure of Knowledge in the Field of Distance Education Using the Co-Word analyses

Background: Co- word analysis is one of the content analysis methods used in scientometric studies and mapping the scientific structure of various fields. The purpose of the present research is to map the structure of distance education using the co-word analysis. Methods: The research method is content analysis using co- word analysis. The research population are 31607 documents indexed in the...

متن کامل

Assessing the Internal Structure of the Ellis Information Retrieval Model in Order to Present the Persian Norm of Web Retrieval Tools

Introduction: Study evaluated the internal structure of Ellis information seeking model in the student community with the aim of presenting the Persian norm. Methods: This is a descriptive-analytical study conducted by cross-sectional survey method in the second semester of the academic year 1399-1400. Population comprise of 280 graduate students at Ahvaz Jundishapur University of Medical Scien...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/0908.0373  شماره 

صفحات  -

تاریخ انتشار 2009